Stabilizing and robustifying the learning mechanisms of artificial neural networks in control engineering applications
نویسندگان
چکیده
This paper discusses the stabilizability of artificial neural networks trained by utilizing the gradient information. The method proposed constructs a dynamic model of the conventional update mechanism and derives the stabilizing values of the learning rate. The stability in this context corresponds to the convergence in adjustable parameters of the neural network structure. It is shown that the selection of the learning rate as imposed by the proposed algorithm results in stable training in the sense of Lyapunov. Furthermore, the algorithm devised filters out the high frequency dynamics of the gradient descent method. The excitation of this dynamics typically occurs in the presence of noise and abruptly changing the parameters of the mapping being learned. This adversely influences the learning performance that can be attained during a training cycle. A natural consequence following this excitation is divergence in parameter space. The method analyzed in this paper integrates the gradient descent technique with variable structure systems methodology, which is well known for its robustness to environmental disturbances. In the simulations, control of a three degrees of freedom anthropoid robot is chosen for the evaluation of the performance. For this purpose, a feedforward neural network structure is utilized as the controller. Highly nonlinear dynamics of the plant, existence of a considerable amount of observation noise, and the adverse effects of gravitational forces constitute the difficulties to be alleviated by the neurocontroller trained with the proposed method. In order to come up with a fair comparison, the results obtained with the pure gradient descent technique with the same initial conditions are also presented and discussed. Q 2000 John Wiley & Sons, Inc.
منابع مشابه
Reinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملA Stabilizing and Robustifying Training Scheme for Artificial Neural Networks Used in Control of Complex Systems
This paper presents a method for stabilizing and robustifying the artificial neural networks trained by utilizing the gradient descent. The method proposed constructs a dynamic model of the conventional update mechanism and derives the stabilizing values of the learning rate. The stability in this context corresponds to the convergence in adjustable parameters of the neural network structure. I...
متن کاملStabilizing and Robustifying the Error Backpropagation Method in Neurocontrol Applications
This paper discusses the stabilizability of artificial neural networks trained by utilizing the gradient information. The method proposed constructs a dynamic model of the conventional update mechanism and derives the stabilizing values of the learning rate. This is achieved by integrating the Error Backpropagation (EBP) technique with Variable Structure Systems (VSS) methodology, which is well...
متن کاملSimultaneous Monitoring of Multivariate-Attribute Process Mean and Variability Using Artificial Neural Networks
In some statistical process control applications, the quality of the product is characterized by thecombination of both correlated variable and attributes quality characteristics. In this paper, we propose anovel control scheme based on the combination of two multi-layer perceptron neural networks forsimultaneous monitoring of mean vector as well as the covariance matrix in multivariate-attribu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. J. Intell. Syst.
دوره 15 شماره
صفحات -
تاریخ انتشار 2000